A Robot that Teaches Itself to Clear Clutter

نویسندگان

  • Abdeslam Boularias
  • Anthony Stentz
چکیده

Outdoor robots, such as search-and-rescue robots and planetary rovers, often need to grasp and push objects such as debris and rocks that have irregular shapes. While most object manipulation techniques require mechanical or geometric models of the objects, objects that are encountered in outdoor environments do not typically have any known models. To grasp unknown novel objects, data-driven approaches are becoming increasingly popular. These approaches learn from examples statistical models that predict the success of grasping or pushing actions. The biggest drawback of statistical models is perhaps their inherent inaccuracies, the predicted values are always subject to an error unless the objects used during testing are identical to some of the objects used for training. To solve this issue, we introduce in this article two self-supervisory learning techniques that allow the robot to adapt to new objects by correcting online the predicted values. Using these techniques, we show how a robot can efficiently learn on the fly and in real-time to push and grasp novel objects. We test both techniques on the task of autonomously clearing piles of natural and man-made objects. Abdeslam Boularias Department of Computer Science Rutgers University New Brunswick, NJ 08901 USA E-mail: [email protected] J. Andrew Bagnell The Robotics Institute Carnegie Mellon University Pittsburgh, PA 15213 USA Anthony Stentz The Robotics Institute Carnegie Mellon University Pittsburgh, PA 15213 USA This work was done while all the authors were at the National Robotics Engineering Center of Carnegie Mellon University 2 Abdeslam Boularias et al. The first technique is used for clearing piles wherein grasping actions alone are enough for clearing the pile. The outcomes of the grasping actions are modeled as a Gaussian Process, and an entropy-guided method is used in order to learn where the best grasp is most likely to be found. The second technique is used for tight piles wherein grasping actions alone are not effective, the robot first needs to push obstacles away in a particular way that helps grasping by creating empty space around them to insert the robot’s fingers. We present a reinforcement learning approach that we use for selecting the best sequence of pushing and grasping actions to execute in order to clear a given pile. We also present a perception technique that we used, along with the online adaptation techniques, to build a fully autonomous system. The integrated system is tested on clearing various piles of objects using a Barrett WAM robot.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Workspace Boundary Avoidance in Robot Teaching by Demonstration Using Fuzzy Impedance Control

The present paper investigates an intuitive way of robot path planning, called robot teaching by demonstration. In this method, an operator holds the robot end-effector and moves it through a number of positions and orientations in order to teach it a desired task. The presented control architecture applies impedance control in such a way that the end-effector follows the operator’s hand with d...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Physics-Based Grasp Planning Through Clutter

We propose a planning method for grasping in cluttered environments, where the robot can make simultaneous contact with multiple objects, manipulating them in a deliberate and controlled fashion. This enables the robot to reach for and grasp the target while simultaneously contacting and moving aside obstacles in order to clear a desired path. We use a physicsbased analysis of pushing to comput...

متن کامل

Visual Tracking using Learning Histogram of Oriented Gradients by SVM on Mobile Robot

The intelligence of a mobile robot is highly dependent on its vision. The main objective of an intelligent mobile robot is in its ability to the online image processing, object detection, and especially visual tracking which is a complex task in stochastic environments. Tracking algorithms suffer from sequence challenges such as illumination variation, occlusion, and background clutter, so an a...

متن کامل

Radar Signal Detection in K-distributed Clutter by Pade Approximation

In this paper, two suboptimum detectors are proposed for coherent radar signal detection in K-distributed clutter. Assuming certain values for several initial moments of clutter amplitude, the characteristic function of the clutter amplitude is approximated by a limited series. Using the Pade approximation, it is then converted to a rational fraction. Thus, the pdf of the clutter amplitude is o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016